One Step Pr Vs Multi-Agent Reinforcement Learning: A Shape